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Abstract —We explore the probabilistic foundations of shared 
control in complex dynamic environments. In order to do this, 
we formulate shared control as a random process and describe 
the joint distribution that governs its behavior. For tractability, 
we model the relationships between the operator, autonomy, and 
crowd as an undirected graphical model. Further, we introduce 
an interaction function between the operator and the robot, 
that we call “agreeability”; in combination with the methods 
developed in | [23l . we extend a cooperative collision avoidance 
autonomy to shared control. We therefore quantify the notion 
of simultaneously optimizing over agreeability (between the 
operator and autonomy), and safety and efficiency in crowded 
environments. We show that for a particular form of interaction 
function between the autonomy and the operator, linear blending 
is recovered exactly. Additionally, to recover linear blending, 
unimodal restrictions must be placed on the models describing 
the operator and the autonomy. In turn, these restrictions raise 
questions about the fiexibility and applicability of the linear 
blending framework. Additionally, we present an extension of 
linear blending called “operator biased linear trajectory blend¬ 
ing” (which formalizes some recent approaches in linear blending 
such as f9l) and show that not only is this also a restrictive 
special case of our probabilistic approach, but more importantly, 
is statistically unsound, and thus, mathematically, unsuitable for 
implementation. Instead, we suggest a statistically principled 
approach that guarantees data is used in a consistent manner, 
and show how this alternative approach converges to the full 
probabilistic framework. We conclude by proving that, in general, 
linear blending is suboptimal with respect to the joint metric of 
agreeability, safety, and efficiency. 

I. Introduction 

Fusing human and machine capabilities has been an active 
research topic in computer science for decades. In robotics 
related applications, this line of research is often referred to as 
shared autonomy. While shared autonomy has been addressed 
in toto, it can be broken down into more specialized areas of 
research. In particular, some researchers focus on the fusing of 
human and machine “perception” (see mi Similarly, a great 
deal of research has focused on fusing human and machine 
“decision making” in the machine learning ina, CD, control 
theory ifTTIl . and human robot interaction communities IT^ . 

Under the umbrella of shared decision making, an even 
more focussed line of research has emerged: shared control, 
whereby the moment to moment control commands sent to the 
platform motors are a synthesis of human input and autonomy 
input. Broadly speaking, shared control has been deployed in 
two cases: shared teleoperation (where the human operator 
is not co-located with the robot) and onboard shared control 
(where the human operator is physically on the robot). Shared 
teleoperation is used for numerous applications: search and 
rescue SI and extraterrestrial robotics ^ are two examples. 

Onboard shared control has likewise enjoyed a wide variety 
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Fig. 1: Diagram depicting the relationships of probabilistic shared control. 


of use cases: assistive wheelchair technology ifT^ . assistive au¬ 
tomobile driving, and assistive manufacturing vehicle operation 
(e.g., forklifts) are just a few of the examples. For any of these 
cases, shared control can be broken down into an autonomous 
modeling step, a human modeling step, and a human-machine 
arbitration step (see El for a compelling argument justifying 
this classification scheme). 

In this paper, we explore the probabilistic foundations of 
onboard shared control in the presence of dynamic obstacles 
(“the crowd”): Figure depicts our approach. In order to 
do this, we formulate shared control as a random process 
and describe the joint distribution (over the random operator, 
autonomy, and crowd functions) that governs behavior and 
then propose a tractable model captures notions of operator- 
autonomy agreeability, safety, and efficiency; further, we prove 
that linear blending is a special case of this approach. In 
the course of the proof, we observe the following: first, in 
linear blending, the autonomy reasons independently of the 
operator during optimization, so even if we had a precise 
operator model, the autonomy would not be informed of this 
information until after the optimization. Second, in linear 
blending, the autonomy is limited to a single optimal decision, 
which is then averaged with the operator input—this approach 
leaves no fiexibility in how the autonomy might assist the 
user. Next, we present an extension of linear blending that can 
accommodate more than one statistic about the user (e.g., user 
inferred goal information, preferred trajectories, etc), and find 
that state of the art approaches to this problem are prone to 
statistical inconsistency. We thus present a statistically valid 
model for how to properly condition the autonomy on user 
statistics. We conclude with a section on the optimality (with 
respect to agreeability, safety, and efficiency) of linear blending 
and our probabilistic approach. 
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II. Related Work 

We begin by defining the arbitration step of linear blending: 

uiBit) = Khul + (II.1) 

where, at time t, (t) is the linearly blended shared control 
command sent to the platform actuators, is the human 
operator input (joystick defiections, keyboard inputs, etc.), 
is the next autonomy command, and are the 

operator and autonomy arbitration functions, respectively. To 
ensure that the magnitude of does not exceed the 

magnitude of or we require that + Kji = 1. 

We observe that since = 1 — Kr, there is only one free 
parameter in this formulation. 

This linear arbitration model has enjoyed wide adoption 
in the assistive wheelchair community (Gl, Ezl, ini, (26), 
EqI, CD, ED, ca). Outside of the wheelchair community, 
shared control path planning researchers have widely adopted 
Equation |II^ as a de-facto standard protocol, as extensively 
argued in 191, (81 (in it is argued that “linear policy 
blending can act as a common lens across a wide range of 
literature”). Additionally, the work of (201 . t2M advocates the 
broad adoption of a linear arbitration step for shared control. 

For the purposes of this article, we describe how each 
quantity of Equation |II.1| is computed: 

1) Collect the data at time t: u^.j. are the historical operator 
inputs, are the historical measurements of the state of the 
robot (such as odometry, localization, SLAM output, etc). 

2) Compute the autonomous input This quantity 

may be computed using nearly any off the shelf planning 
algorithm, and is dependent on application. The “Dynamic 
Window Approach” oni and “Vector Field Histograms+” 
f2M are popular approaches to perform obstacle avoidance for 
wheelchairs. Sometimes, the autonomy is biased according to 
data about the operator—for instance, one might imagine an 
offline training phase where the robot is taught “how” to move 
through the space, and then this data could be agglomerated 
using, e.g., inverse optimal control. Alternatively, one might 
bias the autonomous decision making by conditioning the 
planner on the predicted or known human goal. 

3) Compute the arbitration parameters and Kr. A 
wide variety of heuristics have been adopted to compute 
this parameter: confidence in robot trajectory, smoothness, 
mitigating jerk, operator reliability, user desired trajectories, 
safeguarding against unsafe trajectories, etc. Indeed, much of 
the shared control literature is devoted to developing novel 
heuristics to compute this parameter. 

4) Compute the shared control (t) using Equation 


II. 1 


Typically, the data Ui.j. are interpreted literally—no like¬ 
lihood or predictive model filters this data stream. In other 
approaches, operator intention is modeled using a combination 
of dynamic Bayesian networks or Gaussian mixture models. 

For this paper, we adopt the notation zj* = u’l, (that 
is, we treat operator inputs as measurements of the operator 
trajectory, h: f G M ^ A', where A’ is the action space). 
Similarly, we define measurements of the robot trajectory 
f^ and measurements z|.^ of the i’th static or dynamic obstacle 


trajectory f\ We thus work in the space of distributions over 
the operator function h, autonomy function f^, and crowd 
function f = (f^,..., f’^*), measured through z{.^. The integer 
rit is the number of people in the crowd at time t. 

We comment on the work of 0, ina, Q. In these papers, 
the authors construct a probabilistic model over user forward 
trajectories (i.e., a specific and personalized instantiation of 
p{h I They then formulate shared control as a partially 

observable Markov decision process (POMDP), in order to 
capture the effects of robot actions on the probabilistic model 
of the operator. However, for tractability of the POMDP, the 
autonomy is limited to only choosing from the next available 
state—this makes assistive navigation through crowds impos¬ 
sible (as shown in ED). Further, the potential autonomous 
actions are limited by the input device; in one application, the 
autonomy was only able to reason over 9 directions. As we 
discuss in Section |Vl| limiting the autonomy to such a degree 
can negatively impact performance. 


III. Foundations and Implementation of 
Probabilistic Shared Control 

As stated in Section |T| we seek to explore the probabilistic 
foundations of assistive onboard shared control; we thus posit 

(h, U, f)* = arg maxp(h, U, f | z^.t, z{.J. (III.l) 


That is, probabilistic shared control (PSC) is the maximum 
a-posteriori (MAP) value of the joint distribution over the 
operator, autonomy, and crowd. We suggest this approach 
partly based on what we have learned from fully autonomous 
navigation in human crowds (^ —that the interaction model 
between the robot and the human is the most important 
quantity—and partly based on the following: by formulating 
shared control as the MAP value of a joint probability distri¬ 
bution, we can explore the modeling limitations imposed by 
linear blending and the performance consequences of these 
limitations. 

In the remainder of this section, we explain how the 


approach in Equation |III.1| is a natural extension of our 
previous work in (22l . and present a tractable model of the 
joint distribution over the operator, autonomy, and crowd. The 
probabilistic graphical model in Figure [^guides our derivation. 

Definition 1: A cooperative human crowd navigation model 
(as in (22l ) is described by 


f I Zi:t) = f)p(P I zf.i) IIp(r I z\.j.) (III.2) 




where zi-t = [zi?j,z{.^] and p(f-^ | z{?(),p(f® | z^.^) are the 
robot and crowd individual dynamical prediction functions, 
respectively. The robot-crowd interaction function '0(f^,f) 
captures joint collision avoidance: that is, how the robot and 
the crowd cooperatively move around one another so that the 
other can pass. The graphical model is presented on the left 
hand side of Figure 
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Fig. 2: Correspondence between the cooperative model of Equati on |III.2| (left 
hand side of hgure) and probabilistic shared control in Equation |IIL3| 


Definition 2: A probabilistic shared control (PSC) model in 
the presence of static or dynamic obstacles is 

p(h,f^,f I zi,t) = ip{h,{^)p{h I I zi,t) (III.3) 

where p{h \ is the predictive distribution over the 

operator, z^.j. is data generated by the operator, and '0(h, f^) is 
the interaction function between the operator and the robot. In 
analogy with 1^ . the interaction function intends 

to capture how “agreeable” the robot trajectory is with the 
desires of the operator. In this article, we choose 7/;(h, f^) = 
exp(—^(h—f^)(h—f^)^), where 7 captures how tightly the 
autonomy must couple to the operator; it is important to point 
out that many other reasonable functions could be used here. 


Importantly, as shown in Theorem |4.1[ this is the (implicit) 
operator-robot interaction function of linear blending. The 
graphical model is presented on the right hand side of Figure]^ 

The model in Equation |III.3| can be understood in the 
following way: the autonomy (modeled with p(f^,f | zi^t)) 
optimizes over safety and efficiency in crowds—specifically, 
the robot-crowd interaction function 7/;(f^,f) models how to 
move through a crowd in the safest and most efficient way. 
Similarly, the function 7/;(h, f^) models how “agreeable” the 
robot path f^ is to the operator path h. By formulating h and 
f ^ as random functions, fiexibility of both parties is modeled, 
thus enabling a more realistic arbitration (where both parties 
are willing to compromise). This idea was captured in II2TI : the 
robot and the crowd were treated as random functions so that 
they could come to a compromise about how to (cooperatively) 
pass by one another. 

Thus, by choosing the joint argmaxj^ fH f as in Equa¬ 
tion |III.1| we simultaneously optimize over three quantities: 


agreeability, safety, and efficiency. We suggest that such a 
joint optimization over safety and efficiency (the domain of 
the autonomy) and agreeability (adhering to the desires of the 
operator) is a reasonable way to formulate shared control. Our 
analysis in the following sections provides credibility to this 
presumption. 

IV. Linear Blending as Probabilistic Shared 
Control 

To understand linear blending as a special case of Equa¬ 
tion [IIO we consider the following conditioning relationship: 

p(h, f^, f I zi:t) = p(f^, f I zf.j, h)p{h I z^.j). (IV.l) 


To understand how the linear blending modeling assumptions 
effect the full joint, we first insert the linear blending operator 
model: p(h | z^.^) = 6(\\ — z^). Thus, 

p(h, f^, f I zi:() = p(f^, f I zL, z(!(, h)(5(h - z’l) 


:p(f^f 


J 


7 ^ 7 ^ 


)• 


In this case. 


argmaxp(h, f^, f | zi:t) = argmaxp(f^,f | z 
h,f^,f f«,f 


/ yR 


Because this distribution is already conditioned on z^, there 
is no ne ed f or a linear arbitration step (we prove this in 
Theorem 4.1). Eurther, by fixing the operator at z^, we are no 


longer jointly optimizing over the autonomy and the operator. 

Theorem 4.1 (Equation \IIL 1 \ generalizes linear blending): 
Let p{f^,i I zC,z{'j,z'*) = \ zi:t) 

and use Laplace’s Approximation O to approximate 
p(f^,f I zi:t), where f^t+i is a mode of the distribution: 

p(f^,f I Zi,t) I 

Then u^pscit) = = (7“^ + 

and 7 is the operator-autonomy attraction parameter. 

Proof: We compute 




'>^'psc{t) = argmaxp(h,f^,f | zi-t) 
(X argmaxA/’(f^^i 
= argmaxA/’(f^^i 

fR 

L+i 


R 

t+1 


fX,(T) 


(IV.2) 


R\-l\ 


where fx = cr{^z^ + and cr~^ = (7“^ + (cr^) 

To make the correspondence with linear blending explicit, 
we choose = ^/Kr — 7 and 7 = cr^/Kh — cr^. Then 


.R 


cr(-zj* + = — 7 —4 + 

7 CT^ CT^ + 7 


7 


rR . 


.fR 


= Khz^ + Kpf^. 


We recall that + Kr = 1 and so the mixing of ctr and 7 
is expected. Only one free parameter exists in linear blending. 

We now examine the how the assumptions of linear blend¬ 
ing can effect performance. The linear blending restriction of 
the robot-crowd distribution to a unimodal Gaussian during 
arbitration p(f^,f | zf.^z{.j) = \ can 

lead to severe restrictions in shared control c^ability; even 
though a locally optimal autonomy strategy f^ is included 
in the linear blend = KhZ^ + KR^^t-\-l, there is no 
guarantee that the human operator will choose this optima, 
or even a nearby optimum (see Section [Vl|. Alternatively, by 
maintaining the multitude of hypotheses inherent to p(f^,f | 
zi:t), we greatly increase the possibility that the autonomy will 
be able to assist the operator in a way that is both desirable 
and safe. As an example, linear trajectory blending over a safe 
operator input and a safe autonomous input can result in an 
unsafe shared trajectory. That is, the weighted average of two 
safe trajectories can be averaged into an unsafe trajectory. 
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V. Conditional Trajectory Blending 

In this section, we extend our definition of linear blending to 
capture the salient characteristics of recent approaches such 
as El. Broadly speaking, this line of work seeks to include 
additional data about the operator before arbitration. We show 
that this is statistically unfounded, and thus introduce a method 
(conditional trajectory blending) that incorporates operator 
information in a statistically principled way. 

Definition 3 (Extending Equation Let 

1) h ^ p(h I z^.^) be defined through time T > t, such 

that hi [ 1,T] (Z M i —y , 

2) f^ ^ p(f^, f I z{.^), with f^: [1, T] C M i-A A', 

3) be the shared control : [1, T] C M ^ A', 

4) and K^^Kr be the arbitration functions. 

Definition 4 (LTB): Linear Trajectory Blending (LTB) is 
defined as 1) Sample f^ = argmaxfj? f p(f^, f | 

2) Sample h = argmaXhP(h | z^^), and 3) Construct the 
shared control u^rrr = Kfii + KR^^. 

By extending to trajectories, we can more easily incorporate 
operator information into the arbitration step. In this vein, 
we now define an extension of linear trajectory blending 
that biases the autonomous decision making on operator data 
(motivated by the approach in 191). 

Definition 5 (LTBo): Let p(G | z^) be a distribution about 
the operator, where z^ C z^.^. Then operator biased linear 
trajectory blending (LTBo) is defined as 

1) Sample Gi ^ p{G \ zfi). Gi could be a trajectory, a 
waypoint, a goal, or any other relevant quantity. 

2) Sample the operator biased distribution 

f^h = argmaxp(f^,f | zf.^, Gi). 

f^,f 

3) Sample the operator trajectory h = argmaxj^p(h | zj^.^). 

4) Construct the shared control ~ 

With these definitions, we extend the approach of Equation |I lT] 
to include a model of the operator p(h | and enable 

seeding the autonomy with operator information. By modeling 
the operator with a distribution, we potentially bypass the issue 
of noisy inputs leading to “jittery” linear blends. By seeding 
the autonomy with operator statistics, we might drive 
towards solutions more closely aligned with user desire. 

While this approach is sensible, no principles motivate such 
an extension. We thus pause to examine this approach in 
the context of the full joint distribution. In particular, recall 
Equation |lV.lt if we only sample the single operator statistic 
h = argmaxj^p(h | then we are implicitly making the 

assumption that p{h \ z^.f) = J(h — h); thus we have 

p(h,f^,f I Zi:() =p(f^,f I zL,z(*(,h). 

In this case, 

argmaxp(h,f^,f | zi-.t) = argmaxp(f^, f | zL,zfj,h), 

and so (as before) there is no need for a linear arbitration 
step—we have already conditioned the autonomy on the 
operator, and we can recover linear blending by choosing 


p(f^, f I z{.^, z^^, h) as in Theorem 4.1 If we have a separate 
model p{G \ z^) about the operator, we know that it does not 
contain information beyond what is available in p(h | 


since z' 




(see the discussion on the “data processing 
inequality” in [?]); worse, p{G \ z^) can lead to statistical 
inconsistencies (see Lemma [5^ . We now show how to resolve 
this inconsistency and how to combine multiple operator data 
points in a statistically sound manner. 

Definition 6 (Conditional Trajectory Blending): 

Assume that we have the distributions p{h \ z\.f) and 
p(f^,f I zZ,zj^^,hb). Then let ~ p{h \ z^*.^ be a 

collection of samples of the operator model. If we take the 
model of the operator to be p{h \ z^.J = — h^), 

then 

Nh 

p{h, f^, f I zi:0 = p(f^, f I zL, z(^„ h) ^ - h'’) 


b=l 


Nh 


= |zZ,z(^„h'’) (V.l) 


6=1 


where = p{h^ = h | z^. f) is the probability of sample h^. 
We interpret the shared control to be 


Nh 

UcTB = argmax | 

b=i 

In particular, we revisit the case of LTBo: suppose that we 
sample Gi ^ p{h \ z\.f) — Gi is present in this distribution 
since it contains all the data—and then sample h P(h I zZ). 
Then conditional trajectory blending tells us that we should 
find the arg max of the distribution 


Uctb — 

f^,f 


w^p{^^, f 




Gi) + 




In this case, then, '^ltbo'^ however, since conditional 

trajectory blending is derived directly from the full joint we 
know that it is combining the data Gi and h in a statistically 
sound manner. The next theorem provides information about 
the limiting behavior of conditional trajectory blending. 

Theorem 5.1 (CTB approximates PSC): As the number of 
operator samples tends to infinity, probabilistic shared control 
(Equation |IIL1| ) is recovered. 

Proof: Representing p{h \ z^. f) = — h^), 

p(h,f^,f I zi.,t) =p(f^,f I z(*(,zZ,h)p(h I zZ) 

CX3 

= P(f^. f I zL. Zi:t> h) 

6=1 

oo 

= 52 w'’p{f^, f I z(*(, zZ, h''). 


6=1 


It is important to emphasize that conditioning the autonomy 
on operator statistics and then averaging f^h with a separate 
operator statistic h is not just unnecessary, but potentially 
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statistically unsound as well, since it is unclear how such an 
approach originates from Equation IV. 1| 

Lemma 5.2 (LTBo statistically unsound): LTBo is not 
guaranteed to incorporate data in a statistically sound manner. 

Proof: Suppose that one were to sample h ^ p{G \ z^), 
then sample h = argmaXhP_(h | then compute 

and then compute + Since we have incorporated 

h twice in the linear blend, the data has been overused. One 
could potentially compensate for this by “removing” the effect 
of double usage of h in but it is unclear how to do this 
in a statistically sound manner. 

Even if h is sampled twice in CTB, it is only counted once: 
UcTB = argmaXfH^f [2w^p{i^J \ z(.j, h)]. ■ 

Thus, if we have a model of \ z(^(, z{.j, h^) and a 

model of the operator p{h \ z^.^), we have a clear mandate for 
how to correctly formulate shared control. 


VI. Optimality of Shared Control 

We start with the following Gaussian sum approximations: 
p(h I zL) = I andp(f^,f | zi,t) = 

J2n=i I /Lt„, S„). Then we have that 

iphih,f^)p{h I zE)p(f^,f I Zi:t) 

Nh Nr 

= f^) y] am^f{h I Yim) y] I S„), 

m=l n=l 

(VI. 1) 

and as Nh^Nn ^ oo, we recover the densities p(h | z^.^) 
and p(f^,f I zi-t )—see ID We note that the approximation 
/^nA/’(f^ I consists of Only safe modes, since 

f) will assign near zero (3 to any unsafe modes. This is 
not true for the operator Gaussian mixture, since the operator 
may make choices that place the platform on a collision course. 


Theorem 6.1 (LTB suboptimal): If either p(h | z^.^) or 
p(f^,f I zi-t) is multimodal, then is suboptimal with 

respect to agreeability, safety, and efficiency. 

Proof: Let h = argmaxj^p(h | an d f^ = 

argmaxfi? fp(f^,f | zi-t). Then we write Equation 


VI. 1 


as 


Nh Nr 

^^(h, f^) y^ amJ\f{h I Ym) y^ \ ^n) 

m=l n=l 

= f^) (a*(5(h - h)rV(f^ I f«, S*)+ 

Nh Nr 

where we extracted the largest mixture components, and 
rewrote the summation using the notation m 7 ^ * to indicate 
that all the modes except the largest are present. If p(h | z\.j) 
and p(f^,f I zi:t) are unimodal, then we recover linear 
blending as the optimally agreeable, safe, and efficient solution 
(Theorem |4.1[ ). In general, however, the argmaxj^ fH f does 
not correspond to the linear blend. 


We point out that this result is independent of how 7 /;(h, f^) 
is chosen. In other words, for any definition of “agreeability”, 
the suboptimality of linear blending still holds. ■ 





Goal 

A/'(f^ I f^*,Ei) 


: Obstacles 


Start 


Fig. 3: One global autonomy optima at and a safe but suboptimal 
autonomy mode at through some obstacle field (additional autonomous 
modes exist but we leave them off for clarity). The operator’s unimodal 
predicted trajectory at h is safe. Covariance functions removed for clarity. 


To understand why this is important, first consider the 
illustration in Eigurej^ Suppose that there are two autonomous 
safe modes through the obstacle field: A/'(f^ | f^*,5]i) and 
V(f«|Aif,S 2 ).Thus 

■*/)^(h,f^)p(h I zEMf^,f I Zi:t) 

= I +/32V(f'' I Aif,S2)] 

= |^V(f'' I + ^V(f'' I Aif,S2) 

Zi Zj2 

where pi > P 2 (the first mode is the global optima). However 
T cx exp (^-1 (h - f^*)^ (7 + Si )-1 (h - f^*)) 

and so 1 /Zi is exponentially smaller than I/Z 2 since h —f^* 
is much larger than h. — 112 • Thus, the probabilistic shared 
control in this situation is very close to both and h. 

Conversely, Here, if Kh is 

close to then is unsafe. If Kr ^ then the 

autonomy overrides the operator’s (safe) choice. If ^ Kr, 
then the operator is controlling the platform, and is thus not 
being assisted. This begs the following question: what heuristic 
should be employed to choose Kh and KrI By carrying 
multiple modes (as in probabilistic shared control), we bypass 
this dilemma, since heuristics are never invoked. Basic rules of 
probability theory (namely, the normalizing factor) determine 
the best choice. In other words, probabilistic shared control 
is able to determine the shared control in a data driven way 
rather than through anecdote. 

In Eigure the operator chooses an unsafe trajectory. Eor 
linear trajectory blending, Urrr = Khhf-KR^^*, and so we 
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Fig. 4: One global autonomy optima at f^* and a safe but suboptimal 
autonomy mode at through some obstacle field (additional autonomous 
modes exist but we leave them off for clarity). The operator’s unimodal 
predicted trajectory at h is unsafe. Covariance functions removed for clarity. 



Fig. 5: One global autonomy optima at and two safe but suboptimal 
autonomy modes at 1 X 2 and through some obstacle field. The operator’s 
bimodal predicted trajectory centered at h and Covariance functions 
removed for clarity. 


must choose our heuristics such that ^ in order to 
avoid collision—that is, we must insert logic that overrides the 
operator when the operator makes unsafe decisions. 

In contrast, consider 

V’/t(h,f^)p(h I zj.jp(f^,f I Zi:t) = V;h(h,f^)p(h I z^'.Jx 

[/3iV(f^ I f^*, Si) + /? 2 V(f^ I S 2 )] 

[^iV(f« I Si) + ^2V(f« I Aif, S 2 )] 

where we have maintained both autonomous modes and a 
unimodal distribution over h. In this situation is close to 
P 2 , and the difference between the operator mean and either of 
the autonomy means are nearly the same, so Zi is close to Z 2 . 
However, both Ei and I ]2 are both fairly narrow (otherwise, 
they are not safe modes), and because there is flexibility in 
p{h I the MAP value is close to either f^* or 

Because the operator is treated probabilistically, heuristics are 
not used to detect poor operator choices: the autonomy assists 
the operator by blending near the tails of p(h | z^.j). 

In Figure the operator generates ambiguous data about 
how he wishes to move between the start and the goal, 
and so p{h \ z^.^) is bimodal. Linear blending produces 
u^ltb — and so we end up with a situation 

very similar to that discussed in Figure |3] —an autonomy and 
an operator that are needlessly in disagreement, and thus 
difficult to disambiguate with Kh and Kr. To be fair, if h and 
f^* happen to lie close to one another, then linear blending 
provides the optimal solution. But for multimodal autonomous 
and operator distributions, such a situation is the exception 
rather than the rule (this exception requiring that the operator 
make globally optimal decisions). 

For probabilistic shared control, the shared control is likely 
a trajectory near h and —a solution that is both safe and 
respects the operator’s desires (although, depending on the 
weights am and /3n the solution may be near 


a solution that reflects the operator’s desires and is still safe). 

These flgures illustrate Theorem 6.1 that linear blending is 
suboptimal when either the operator or the autonomy model 
is multimodal; anecdotally, this is a result of restricting the 
blend to a single autonomous decision and a single operator 
trajectory, which can be in conflict when nontrivial modes are 
not reasoned over. 


VII. Conclusions 

We presented a probabilistic formalism for shared control, 
and showed how the state of the art in shared control, linear 
blending, and a trajectory based generalization, are special 
cases of our probabilistic approach. Further, we showed that 
linear blending is prone to statistical inconsistencies, but that 
the probabilistic approach is not. Finally, we proved that linear 
blending is suboptimal with respect to agreeability, safety and 
efficiency. Importantly, no experiments were performed, and 
so questions remain as to the viability of this approach. 
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